NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SymbolFit: Automatic Parametric Modeling with Symbolic Regression

https://doi.org/10.1007/s41781-025-00140-9

Tsoi, Ho_Fung; Rankin, Dylan; Caillol, Cecile; Cranmer, Miles; Dasu, Sridhara; Duarte, Javier; Harris, Philip; Lipeles, Elliot; Loncar, Vladimir (July 2025, Computing and Software for Big Science)

Abstract We introduce SymbolFit (API: https://github.com/hftsoi/symbolfit), a framework that automates parametric modeling by using symbolic regression to perform a machine-search for functions that fit the data while simultaneously providing uncertainty estimates in a single run. Traditionally, constructing a parametric model to accurately describe binned data has been a manual and iterative process, requiring an adequate functional form to be determined before the fit can be performed. The main challenge arises when the appropriate functional forms cannot be derived from first principles, especially when there is no underlying true closed-form function for the distribution. In this work, we develop a framework that automates and streamlines the process by utilizing symbolic regression, a machine learning technique that explores a vast space of candidate functions without requiring a predefined functional form because the functional form itself is treated as a trainable parameter, making the process far more efficient and effortless than traditional regression methods. We demonstrate the framework in high-energy physics experiments at the CERN Large Hadron Collider (LHC) using five real proton-proton collision datasets from new physics searches, including background modeling in resonance searches for high-mass dijet, trijet, paired-dijet, diphoton, and dimuon events. We show that our framework can flexibly and efficiently generate a wide range of candidate functions that fit a nontrivial distribution well using a simple fit configuration that varies only by random seed, and that the same fit configuration, which defines a vast function space, can also be applied to distributions of different shapes, whereas achieving a comparable result with traditional methods would have required extensive manual effort.
more » « less
SymbolNet: neural symbolic regression with adaptive dynamic pruning for compression

https://doi.org/10.1088/2632-2153/adaad8

Tsoi, Ho_Fung; Loncar, Vladimir; Dasu, Sridhara; Harris, Philip (January 2025, Machine Learning: Science and Technology)

Abstract Compact symbolic expressions have been shown to be more efficient than neural network (NN) models in terms of resource consumption and inference speed when implemented on custom hardware such as field-programmable gate arrays (FPGAs), while maintaining comparable accuracy (Tsoiet al2024EPJ Web Conf.29509036). These capabilities are highly valuable in environments with stringent computational resource constraints, such as high-energy physics experiments at the CERN Large Hadron Collider. However, finding compact expressions for high-dimensional datasets remains challenging due to the inherent limitations of genetic programming (GP), the search algorithm of most symbolic regression (SR) methods. Contrary to GP, the NN approach to SR offers scalability to high-dimensional inputs and leverages gradient methods for faster equation searching. Common ways of constraining expression complexity often involve multistage pruning with fine-tuning, which can result in significant performance loss. In this work, we propose $S y m b o l N e t$ , a NN approach to SR specifically designed as a model compression technique, aimed at enabling low-latency inference for high-dimensional inputs on custom hardware such as FPGAs. This framework allows dynamic pruning of model weights, input features, and mathematical operators in a single training process, where both training loss and expression complexity are optimized simultaneously. We introduce a sparsity regularization term for each pruning type, which can adaptively adjust its strength, leading to convergence at a target sparsity ratio. Unlike most existing SR methods that struggle with datasets containing more than $O (10)$ inputs, we demonstrate the effectiveness of our model on the LHC jet tagging task (16 inputs), MNIST (784 inputs), and SVHN (3072 inputs).
more » « less
Reliable edge machine learning hardware for scientific applications

https://doi.org/10.1109/VTS60656.2024.10538639

Baldi, Tommaso; Campos, Javier; Hawks, Ben; Ngadiuba, Jennifer; Tran, Nhan; Diaz, Daniel; Duarte, Javier; Kastner, Ryan; Meza, Andres; Quinnan, Melissa; et al (April 2024, IEEE)

Full Text Available
Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml

https://doi.org/10.1088/2632-2153/acc0d7

Khoda, Elham E; Rankin, Dylan; Teixeira de Lima, Rafael; Harris, Philip; Hauck, Scott; Hsu, Shih-Chieh; Kagan, Michael; Loncar, Vladimir; Paikara, Chaitanya; Rao, Richa; et al (April 2023, Machine Learning: Science and Technology)

Abstract Recurrent neural networks have been shown to be effective architectures for many tasks in high energy physics, and thus have been widely adopted. Their use in low-latency environments has, however, been limited as a result of the difficulties of implementing recurrent architectures on field-programmable gate arrays (FPGAs). In this paper we present an implementation of two types of recurrent neural network layers—long short-term memory and gated recurrent unit—within the hls4ml framework. We demonstrate that our implementation is capable of producing effective designs for both small and large models, and can be customized to meet specific design requirements for inference latencies and FPGA resources. We show the performance and synthesized designs for multiple neural networks, many of which are trained specifically for jet identification tasks at the CERN Large Hadron Collider.
more » « less
Full Text Available
Real-time semantic segmentation on FPGAs for autonomous vehicles with hls4ml

https://doi.org/10.1088/2632-2153/ac9cb5

Ghielmetti, Nicolò; Loncar, Vladimir; Pierini, Maurizio; Roed, Marcel; Summers, Sioni; Aarrestad, Thea; Petersson, Christoffer; Linander, Hampus; Ngadiuba, Jennifer; Lin, Kelvin; et al (November 2022, Machine Learning: Science and Technology)

Abstract In this paper, we investigate how field programmable gate arrays can serve as hardware accelerators for real-time semantic segmentation tasks relevant for autonomous driving. Considering compressed versions of the ENet convolutional neural network architecture, we demonstrate a fully-on-chip deployment with a latency of 4.9 ms per image, using less than 30% of the available resources on a Xilinx ZCU102 evaluation board. The latency is reduced to 3 ms per image when increasing the batch size to ten, corresponding to the use case where the autonomous vehicle receives inputs from multiple cameras simultaneously. We show, through aggressive filter reduction and heterogeneous quantization-aware training, and an optimized implementation of convolutional layers, that the power consumption and resource utilization can be significantly reduced while maintaining accuracy on the Cityscapes dataset.
more » « less
Full Text Available
QONNX: Representing Arbitrary-Precision Quantized Neural Networks

Pappalardo, Alessandro; Umuroglu, Yaman; Blott, Michaela; Mitrevski, Jovan; Hawks, Ben; Tran, Nhan; Loncar, Vladimir; Summers, Sioni; Borras, Hendrik; Muhizi, Jules; et al (January 2022, Fermi National Accelerator Lab)

Full Text Available
A Reconfigurable Neural Network ASIC for Detector Front-End Data Compression at the HL-LHC

https://doi.org/10.1109/TNS.2021.3087100

Guglielmo, Giuseppe Di; Fahim, Farah; Herwig, Christian; Valentin, Manuel Blanco; Duarte, Javier; Gingu, Cristian; Harris, Philip; Hirschauer, James; Kwok, Martin; Loncar, Vladimir; et al (August 2021, IEEE Transactions on Nuclear Science)
null (Ed.)
Full Text Available
Fast convolutional neural networks on FPGAs with hls4ml

https://doi.org/10.1088/2632-2153/ac0ea1

Aarrestad, Thea; Loncar, Vladimir; Ghielmetti, Nicolò; Pierini, Maurizio; Summers, Sioni; Ngadiuba, Jennifer; Petersson, Christoffer; Linander, Hampus; Iiyama, Yutaro; Di Guglielmo, Giuseppe; et al (July 2021, Machine Learning: Science and Technology)
null (Ed.)
Full Text Available
AIgean: An Open Framework for Machine Learning on Heterogeneous Clusters

https://doi.org/10.1109/FCCM48280.2020.00072

Tarafdar, Naif; Guglielmo, Giuseppe Di; Harris, Philip C; Krupa, Jeffrey D; Loncar, Vladimir; Rankin, Dylan S; Tran, Nhan; Wu, Zhenbin; Shen, Qianfeng; Chow, Paul (May 2020, FCCM conference proceedings)

Full Text Available
Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Heinz, Aneesh; Razavimaleki, Vasall; Duarte, Javier; DeZoort, Gage; Ojalvo, Isobel; Thais, Savannah; Atkinson, Markus; Neubauer, Mark; Gray, Lindsey; Jindariani, Sergo; et al (November 2020, ArXivorg)
null (Ed.)
We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, and tracking performance of our implementations based on a benchmark dataset. We find a considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing workflows and the FPGA-based Level-1 trigger at the CERN Large Hadron Collider.
more » « less
Full Text Available

« Prev Next »

Search for: All records